HPCTOOLKIT: tools for performance analysis of optimized parallel programs

نویسندگان

  • Laksono Adhianto
  • S. Banerjee
  • Michael W. Fagan
  • Mark Krentel
  • Gabriel Marin
  • John M. Mellor-Crummey
  • Nathan R. Tallent
چکیده

HPCTOOLKIT is an integrated suite of tools that supports measurement, analysis, attribution, and presentation of application performance for both sequential and parallel programs. HPCTOOLKIT can pinpoint and quantify scalability bottlenecks in fully-optimized parallel programs with a measurement overhead of only a few percent. Recently, new capabilities were added to HPCTOOLKIT for collecting call path profiles for fully-optimized codes without any compiler support, pinpointing and quantifying bottlenecks in multithreaded programs, exploring performance information and source code using a new user interface, and displaying hierarchical space-time diagrams based on traces of asynchronous call stack samples. This paper provides an overview of HPCTOOLKIT and illustrates its utility for performance analysis of parallel applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

HPCToolkit: Performance Measurement and Analysis for Supercomputers with Node-level Parallelism

Today’s largest supercomputers consist of tens of thousands of nodes equipped with one or more multi-core microprocessors. A challenge for performance tools is that bottlenecks in programs executing on these systems may arise from a myriad of causes. To address this problem, Rice University is developing HPCTOOLKIT an integrated suite of tools that supports sampling-based measurement, analysis,...

متن کامل

A Methodology for Accurate, Effective and Scalable Performance Analysis of Application Programs

We describe a unique and comprehensive methodology for accurately measuring and effectively analyzing the performance of an application’s execution. This methodology is 1) accurate, because it assiduously avoids systematic measurement error (such as that introduced by instrumentation); 2) effective, because it associates useful performance metrics (such as memory bandwidth) with important sourc...

متن کامل

Application Performance Profiling on the Cray XD1 using HPCToolkit∗

HPCToolkit is an open-source suite of multi-platform tools for profile-based performance analysis of sequential and parallel applications. The toolkit consists of components for collecting performance measurements of fully-optimized executables without adding instrumentation, analyzing application binaries to understand the structure of optimized code, correlating measurements with program stru...

متن کامل

LACSI Impact on ASC Projects at LANL

These tools support effective analysis of the node performance of large-scale scientific applications consisting of thousands of procedures, hundreds of thousands of lines of code, and external (possibly binary-only) libraries. The toolkit is designed to work directly with optimized application binaries. By doing so, the toolkit is language independent and it avoids the need for manual instrume...

متن کامل

Performance Analysis and Optimization of a Hybrid Seismic Imaging Application

Applications to process seismic data are computationally expensive and, therefore, employ scalable parallel systems to produce timely results. Here we describe our experiences of using performance analysis tools to gain insight into an MPI+OpenMP code developed by Shell that performs Reverse Time Migration on a cluster to produce models of the subsurface. Tuning MPI+OpenMP programs for modern p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Concurrency and Computation: Practice and Experience

دوره 22  شماره 

صفحات  -

تاریخ انتشار 2010